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Abstract 

Objective To determine comparative safety and effectiveness of 
combinations of bearing surfaces of hip implants. 

Design Systematic review of clinical trials, observational studies, and 
registries. 

Data sources Medline, Embase, Cochrane Controlled Trials Register, 
reference lists of articles, annual reports of major registries, summaries 
of safety and effectiveness for pre-market application and mandated 
post-market studies at the United States Food and Drug Administration. 

Study seiection Criteria for inclusion were comparative studies in adults 
reporting information for various combinations of bearings (such as metal 
on metal and ceramic on ceramic). Data search, abstraction, and 
analyses were independently performed and confirmed by at least two 
authors. Qualitative data syntheses were performed. 

Resuits There were 3139 patients and 3404 hips enrolled in 18 
comparative studies and over 830 000 operations in national registries. 
The mean age range in the trials was 42-71 , and 26-88% were women. 
Disease specific functional outcomes and general quality of life scores 
were no different or they favoured patients receiving metal on 
polyethylene rather than metal on metal in the trials. While one clinical 
study reported fewer dislocations associated with metal on metal 
implants, in the three largest national registries there was evidence of 
higher rates of implant revision associated with metal on metal implants 
compared with metal on polyethylene. One trial reported fewer revisions 
with ceramic on ceramic compared with metal on polyethylene implants, 
but data from national registries did not support this finding. 



Conciusions There is limited evidence regarding comparative 
effectiveness of various hip implant bearings. Results do not indicate 
any advantage for metal on metal or ceramic on ceramic implants 
compared with traditional metal on polyethylene or ceramic on 
polyethylene bearings. 

Introduction 

Every year over 700 000 joint replacements are performed in 
the Unites States alone,' of which over 270 000 are hip 
replacements. The annual volume of hip joint replacement is 
projected to double over the next decade. Moreover, these 
surgeries are expected to become more expensive with total 
costs tripling in just five years." While joint replacement is a 
successful operation and deals with a great public health burden, 
substantial numbers of patients who receive hip implants require 
revision surgery within 10 years to replace the implant because 
of infection, dislocation, wear, instabihty, loosening, or other 
mechanical failures.' The bearing/articulating surface is 
designed to endure the contact stress and is one of the key design 
factors to reduce the complications and the chance of revision. 
Hip implants with metal femoral heads with polyethylene cups 
as articulating surfaces are associated with low rates of revision.' 
National registries throughout the world continue reporting low 
risk of revision with this traditional bearing.*"'" On the other 
hand, rapid growth of technology introduced several alternative 
bearings to the market that aimed to further reduce implant wear 
and subsequently the time to revision surgery. These alternatives 
include metal on metal and ceramic on ceramic bearings." 
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Metal on metal bearings are particularly attractive to surgeons 
as they allow use of larger femoral heads (>32 mm v <32 mm) 
and supposedly reduce the risk of dislocation and improve the 
functional outcomes in younger patients.'" " They were quickly 
adopted by surgeons and often used even in older patients. In 
one study, one out of three older patients undergoing hip surgery 
received metal on metal hip implants." Recently, however, the 
United Kingdom regulatory agency (Medicines and Healthcare 
Products Regulatory Agency) alerted the public about severe 
cases of metallosis (accumulation of metal ions in the tissues) 
related to the release of metal ions from the implants, ^ and the 
British Orthopaedic Association developed specific 
recommendations." Furthermore, in August 2010 Johnson and 
Johnson recalled over 93 000 metal on metal implants called 
"ASR."'^^ The recall received widespread coverage in the New 
York Times," and the British MedicalJournal published papers 
related to device regulation, clinician involvement, and the need 
to develop evidence."" "* 

In the US, the Food and Drug Administration has been closely 
monitoring reports of failure of hip implants related to various 
bearing surfaces and, in November 2009, initiated a 
comprehensive evaluation and synthesis of evidence of reported 
outcomes for approved implants. We systematically reviewed 
the evidence to determine the short and long term outcomes 
reported by patients undergoing hip replacement and the rates 
of revision after use of implants with various bearings. 

Methods 

Identification of studies 

We worked with the Food and Drug Administration to identify 
the summaries of safety and effectiveness for all pre-market 
apphcation trials and relevant Food and Drug Administration 
mandated post-market studies reporting comparative information 
for hip implants. Summaries of safety and effectiveness of all 
ceramic on ceramic hip implants are publicly available as these 
were the only pre-market application hip replacement devices. 
We then identified all relevant publications related to pre-market 
apphcation trials in Medhne, Embase, and the Cochrane 
Controlled Trials Register to learn about long term results (any 
follow-up) reported. Additionally, we performed a 
comprehensive search of all other publications in Medline, 
Embase, and the Cochrane Controlled Trials Register from 
January 1995 to June 201 1. We used all MeSH terms 
corresponding to hip bearing surface as well as any text words 
that were applicable to locate the studies. The systematic search 
strategy is outlined in appendix 1 on bmj.com. We also searched 
the reference Usts of trials and reviews for additional studies. 
Finally, we reviewed the online annual reports of all registries 
(within and outside the US) that report information from their 
registry (see appendix 2 on bmj.com for details). 

We hmited our study to papers in English. Our inclusion criteria 
were clinical studies, only adults enrolled, reporting any one of 
the clinical outcomes of interest (any functional outcomes or 
revisions, or both), and conventional hip replacement. 

Abstraction of data 

The data were independently abstracted and checked multiple 
times by two abstractors (SD, SJ). The senior author (AS) 
resolved any discrepancies. Both SD and SJ were trained by 
using a large learning sample at the stage of abstract screening 
and manuscript review. One of the senior investigators (AS) 
reviewed a 20% random sample of excluded studies and all 
included abstracts for quality control. The process led to 100% 
agreement among the three authors. 



We abstracted information on study design, quality, number of 
hips (patients), date of procedure, age, sex, number of surgeons 
and centres, diagnosis (osteoarthritis or avascular necrosis), 
mean follow-up, percentage follow-up, and information on the 
manufacturer. The main outcome measures included any 
functional outcome and occurrence of revision. 

Methodological quality 

To evaluate quality of included studies we used four important 
criteria that lead to conclusions of overall perception of the risk 
of bias. As masking/blinding of surgeons in these trials is not 
applicable, we considered "randomised" study description; 
description of a correct randomisation procedure; masking of 
patients and outcome assessors; intention to treat analysis; and 
allocation concealment (methods such as sealed envelope, 
central telephone) including the time of the announcement of 
the allocation (in the operating room v before). We used the 
STROBE criteria to assess the quality of the observational 
studies.''' Quality was classified as high (low risk of bias), 
intermediate (moderate risk of bias), and low (high risk of bias). 
No formal tool or score was developed. The high quahty 
classification was based on adequate description of 
randomisation or comparator group, units of randomisation 
(patient v hip), masking (participants, outcome assessors), and 
allocation concealment. If studies did not report any of the 
criteria and described potentially problematic designs (such as 
questionable exclusion of patients, not describing the group 
balance regarding prognostic factors), we categorised these 
studies as low quality. Other classifications (moderate, moderate 
to high, moderate to low) were based on overall chnical and 
critical judgments of proximity to high or low quality. Thus, 
the decision to classify studies into the three categories was 
based on overall perception of risk of bias after evaluation of 
the quality criteria, descriptions related to design, and 
presentations of data. 

Publication bias 

While pubhcation bias is applicable to any topic area, we believe 
that comparative surgical trials in orthopaedics are relatively 
rare, and papers for such comparisons are likely to be submitted 
and accepted by journals regardless of the findings. Another 
possible publication bias might be related to selective reporting 
of outcomes. Many studies did not report all outcomes of interest 
to us (particularly revision surgery). 

Qualitative and quantitative analyses 

We evaluated chnical heterogeneity (diversity) among the trials 
by abstracting data on included populations and evaluating 
definitions of outcomes (often not reported). Functional 
outcomes included disease specific Harris hip score and general 
quality of life measures (SF-12). We collected data on number 
of events (categorical data), scores (continuous data), relative 
risks (when reported) and their 95% confidence intervals, and 
P values. We report evidence for alternative bearings (such as 
metal on metal and ceramic on ceramic) compared with 
traditional bearings (such as ceramic on polyethylene and metal 
on polyethylene) bearings. We also report any comparative 
evidence between two traditional bearings (such as ceramic on 
polyethylene versus metal on polyethylene) and between 
alternative bearings (ceramic on ceramic versus metal on metal). 
Formal meta-analysis was considered in one instance of 
relatively complete reporting of functional outcome. The results 
from each study are expressed as a mean difference with 95% 
confidence intervals and combined with a random effect method. 
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Statistical heterogeneity was evaluated with I" estimates. Rev 
Man 5.1 software was used for statistical analyses. 

Results 

Studies and populations 

We identified and reviewed 3254 abstracts (see appendix 3 on 
bmj.com). We identified 18 randomised trials and comparative 
observational studies.'"'" One study (Capello and colleagues) 
included both randomised and non-randomised arms and is 
represented twice in the tables. ^'"^ " There were 3139 patients 
and 3404 hips enrolled in 18 comparative studies (table 1)||. 
The mean age ranged from 42 to 7 1, and only five out of 18 
comparative studies reported standard deviation. The proportion 
of women was 26-88%; four studies did not report information 
on sex.^' None of the studies reported age specific or sex 
specific analyses, and none reported race or patients' 
comorbidity, such as diabetes and obesity. Only 13 reported 
information on diagnosis, and osteoarthritis was the most 
prevalent diagnosis in 12 studies (range 41-100%). One study 
predominantly enrolled patients with avascular necrosis,''' which 
was the second most common diagnosis in the trials. Reported 
outcomes included functional outcomes such as disease specific 
Harris hip score, general quality of life measure (such as SF- 12), 
and revision surgery. Follow-up ranged from three months to 
8.1 years (table 1)J|. 

Methodological quality 

Four studies were classified as moderate to high quality, five 
studies as moderate quality, and six studies as low quality. 
Subgroup analyses based on quality were not informative 
because of homogenous results and relatively few studies 
contributing to each comparison. We report the quality 
information to illustrate the overall lack of appreciation of design 
and analysis reporting in the comparative studies of bearing 
surface. 

Patients' outcomes: Harris hip score 

The Harris hip score (with higher scores indicating better 
outcomes) was the most commonly used functional outcome 
measurement and was reported by 16 out of 18 studies (table 
2JJ,). Only 10 studies, however, reported both baseline and 
follow-up measurements; six reported only postoperative scores. 

Metal on metal v traditional bearing surfaces — All studies had 
follow-up measurements, but one study did not report 
preoperative scores (table 2).JJ Four studies compared metal on 
metal bearing with metal on polyethylene at two years. The 
reporting was relatively complete in these studies, and the scores 
were combined. The combined estimate favoured metal on 
polyethylene bearing. There was a significant 2.4 points higher 
score (95% confidence interval 4.5 to 0.3) associated with metal 
on polyethylene compared with metal on metal bearing (fig 1 \\). 
Three studies reported Harris hip scores beyond two years: two 
studies compared metal on metal with metal on polyethylene 
and another with ceramic on polyethylene (fig 2||). The 
individual and combined scores were similar in all three studies 
for all bearings. 

Ceramic on ceramic v traditional bearing surfaces — Five 
studies compared ceramic on ceramic bearings with ceramic on 
polyethylene bearing. Four studies had both baseline and 
follow-up measurements, and one study had measurements only 
after surgery. Two studies compared ceramic on ceramic with 
metal on polyethylene bearing and had measurements only after 
surgery. One of the studies was part of the pre-market 



application process and had a randomised and non-randomised 
arm (and hence is represented twice in table 2).™"'" " The Harris 
hip scores were similar at baseline and follow-up in all groups 
that compared ceramic on ceramic with other bearing surfaces. 

Ceramic on polyethylene v metal on polyethylene — Two studies 
compared ceramic on polyethylene with metal on polyethylene, 
and one of the studies had only follow-up measurement. There 
were no differences in Harris hip scores. 

Metal on metal v ceramic on ceramic — One study compared 
both alternative bearing surfaces and did not find any difference 
in scores at follow-up."" The study included only 28 hips 
(patients) and was likely to be underpowered to determine any 
differences between the groups. 

Patients' outcomes: other functional scores 

Five studies compared metal on metal with other bearing 
surfaces (table 3JJ.). Four of these compared metal on metal with 
metal on polyethylene and one study compared metal on metal 
with ceramic on ceramic. One study reported substantially better 
physical functioning after surgery associated with metal on 
polyethylene compared with metal on metal.^* 

One study compared ceramic on ceramic with ceramic on 
polyethylene, and there was no difference between the groups. 

Patients' outcomes: revision occurrence after 
hip replacement 

Evidence from comparative studies 

Ten comparative evaluations reported information on revisions, 
and only seven reported data on dislocations. No study 
specifically reported aseptic loosening. 

Metal on metal v traditional bearing surfaces — Two studies 
compared metal on metal with metal on polyethylene bearing 
and found no difference in occurrence of revisions. Two other 
studies compared metal on metal with ceramic on polyethylene 
bearing, and one of these studies^' reported substantially higher 
occurrence of dislocation in the ceramic on polyethylene group. 
Occurrence of dislocation was not different in the other three 
studies. 

Ceramic on ceramic v traditional bearing surfaces — Seven 
comparative studies reported this information. Two studies 
compared ceramic on ceramic with metal on polyethylene 
bearing. As described before, one of the studies (Capello and 
colleagues) included both randomised and non-randomised arms 
(represented twice in table 4\l).^'"~ " This study reported 
substantially lower occurrence of revision in the ceramic on 
ceramic arms compared with metal on polyethylene arm. Five 
studies compared ceramic on ceramic with ceramic on 
polyethylene and found no quahtative or quantitative differences 
between the groups in terms of revision occurrence. 

Ceramic on polyethylene v metal on polyethylene — One 
comparative study compared ceramic on polyethylene with 
metal on polyethylene and found no difference in revision 
occurrence. 

Metal on metal v ceramic on ceramic — There were no 
comparative studies discussing occurrence of revision. 

Emerging evidence on revision from national 
registries 

We reviewed the annual reports of 29 national and regional 
registries. Five national registries'*"'" *' including over 830 000 
surgeries reported information on bearing surface, and one US 
study including over 57 000 surgeries reported data from the 
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Medicare database." Some of the large national registries, such 
as the Swedish registry/' reported information on every implant 
used in the country but did not stratify by bearing surface. 
National registries provided important quantitative data, but 
they varied in their methods of risk adjustment. While we 
considered national registry reports as comparative studies, we 
did not evaluate quality because of lack of information on 
methods. 

Main findings 

Metal on metal was associated with higher occurrence of 
revision compared with metal on polyethylene in the adjusted 
analyses of three national registries: Australian, New Zealand, 
and England and Wales National Registries (including over 720 
000 patients).*"'" Three other smaller registries (including the 
Center for Medicare and Medicaid Services (CMS) database) 
did not report such results (fig 311). 

Ceramic on ceramic was associated with a higher occurrence 
of revision than metal on polyethylene in the adjusted analyses 
of the New Zealand National Registry.'' Five registries (including 
the CMS database) did not report any difference. 

Ceramic on polyethylene was associated with a higher 
occurrence of revision than metal on polyethylene in the New 
Zealand National Registry but a lower occurrence in the England 
and Wales National Registry. Three other national registries did 
not report any difference (ceramic on polyethylene implant 
category was not available in the CMS database). 

Discussion 

Functional outcomes, traditionally thought of as primary 
effectiveness outcomes, were no different among patients 
receiving hip replacements with various bearings. We conducted 
this systematic evidence review as part of the new project 
initiated by the Food and Drug Administration to create a 
framework for post-market evaluation of orthopaedic implants. 
This systematic evaluation helps to deal with evidence gaps in 
comparisons of hip bearing surfaces. 

We found some evidence for lower Harris hip scores (by 2.4 
points) at two year follow-up for metal on metal bearing 
compared with metal on polyethylene. There was also evidence 
from one trial that patients receiving metal on metal implants 
have a lower quality of life (functional component) than patients 
receiving metal on polyethylene bearing."* Though these 
differences are small and might not be clinically relevant, they 
are still valuable findings to be considered for future hypothesis 
generation. 

Evidence on implant revision did not favour metal on metal 
implants. While one study reported fewer dislocations associated 
with metal on metal implants, three other trials did not support 
this finding, and the three largest national registries reported 
substantially higher occurrence of revision associated with metal 
on metal implants compared with metal on polyethylene bearing 
(fig 2)^. 

One medium size comparative clinical trial reported a 
substantially lower rate of revision associated with ceramic on 
ceramic implants compared with metal on polyethylene 
implants.''" This conflicts with results reported in national 
registries (fig 2J|). Five national data sources/registries did not 
report any difference between the ceramic on ceramic and 
traditional bearings, and a large registry report from New 
Zealand favoured ceramic on polyethylene (traditional bearing) 
compared with ceramic on ceramic surfaces. 



Comparison with other studies 

To our knowledge this is the first systematic appraisal of 
assessment of clinical outcomes after hip replacement with 
various bearing surfaces. We did not summarise the evidence 
related to metal sensitivity or toxicity, but our findings related 
to clinical outcomes of metal on metal implants are augmented 
by reports that show severe metallosis associated with such 
implants." Concerns regarding the effects of metal ions have 
been noted after a landmark paper by MacDonald et al^'' that 
noted a 5.3-fold increase in erythrocyte cobalt, a 35-fold increase 
in urinary cobalt, and a 17-fold increase in urinary chromium 
concentrations in patients who received metal on metal hip 
replacements compared with those who received metal on 
polyethylene. 

Strengths and limitations of study 

We found 18 comparative studies of bearing surface and six 
national data source/registry reports and comprehensively 
summarised the strengths and limitations of current data. The 
overall quality of reporting in the comparative studies was less 
than adequate in the trials of bearing surface. Many investigators 
did not report their power to detect minimally important 
differences among groups and often omitted standard deviations 
and range of scores observed. In addition, few studies were 
adequately powered or reported subgroup analyses by age, sex, 
underlying aetiology, or femoral head size. Classification of 
polyethylene inserts used in hip replacement also needs 
harmonisation; the various terms used to define cross linked 
polyethylene hinder subgroup analyses. Our review was hmited 
to English language publications, and there is chance of 
publication bias. There was substantial heterogeneity in registry 
evidence. Formal data pooling from registries will require 
harmonisation of data definitions and analytical methods. 

Policy implications 

Changing technology, the need to have large numbers of 
patients, long term follow-up to establish the safety, and short 
and long term outcomes reported by patients to establish the 
effectiveness all lead to a lack of strong evidence in 
orthopaedics. For example, patients who receive hip implants 
often require revision surgery within 10 years, when the implants 
fail because of infection, dislocation, wear, instability, loosening, 
or other mechanical failures. In addition, cUnical trials assessing 
bearing surfaces are conducted in relatively unique environments 
defined by skilled surgeons operating at high volume centres 
and often concentrating on clinical and radiological indirect 
measures of device safety and effectiveness. In general this is 
a reflection of the current state of research in implantable 
devices, where comparative trials are rare and often not 
applicable."* 

Registries are likely to fill the evidence gap in the immediate 
future. Large registries or networks of registries capturing 
various orthopaedic devices are particularly important for 
evaluation of comparative outcomes and active surveillance. 
Only large longitudinal multinational registries can provide 
denominator data for adverse events related to specific implants 
and allow proper conduct of comparative safety and 
effectiveness studies, particularly for rare end points. Current 
registry annual reports that include comparative evaluations, 
however, are not harmonised in terms of methods and reporting. 
Accordingly, the findings are tentative. To deal with this 
limitation the US Food and Drug Administration Center for 
Devices and Radiologic Health started an important initiative 
called the International Consortium of Orthopaedic Registries, 
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which aims to build the foundations for a worldwide research 
consortium of orthopaedic registries. The consortium represents 
more than 15 countries that have existing registries with a 
mission to improve the safety and effectiveness of orthopaedic 
devices and procedures through collaboration. Currently, these 
international registries included more than 3 500 000 orthopaedic 
surgeries capturing all implantable devices on the market. 

Conclusions 

There is limited evidence regarding comparative effectiveness 
of various hip implant bearings, and the results do not indicate 
any advantage for metal on metal or ceramic on ceramic 
implants compared with traditional bearings. A large and high 
quahty randomised controlled trial of bearing surfaces in total 
hip replacement needs to be conducted before any claims of 
benefit are made. Until then national registries provide important 
real world data that are critical for the safety and future 
comparative safety and effectiveness evaluation. 
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What is already known on this topic 

There have been severe cases of accumulation of metal ions in tissues of patients with metal on metal hip implants 

Metal on metal and ceramic on ceramic hip implants might not be associated with any advantage compared with traditional bearings 
such as metal on polyethylene or ceramic on polyethylene 

What this study adds 

Disease specific functional outcomes such as Harris hip score and general quality of life measures were no different between patients 
with metal on metal or ceramic on ceramic hip implants compared with traditional hip implants 

There is some evidence of higher rates of revision surgery associated with metal on metal implants compared with metal on polyethylene 
implants 

While an investigational device exemption trial reported fewer revisions associated with ceramic on ceramic implants, the emerging 
evidence from national registries does not support these results 



Tables 



Table 1 1 Baseline data in comparative studies of various bearing surfaces in hip replacement systems 











Mean (SD, 




% 










No of 






range) 




osteo-arthritis 


Mean 








Study hips 


Date of 


Site, surgeon, 


age 


% 


(avascular 


follow-up 


(%) 




Study 


Design quality (patients) 


procedure 


centre 


(years) 


women 


necrosis) 


(years) 


follow-up 


Manufacturer 


Metal on polyethylene v metal on metal 


Dahlstrand, 


Randomised Moderate 54 (54) 


Oct 


Single 


65 (NA, 


50 


100 (0) 


24 months 


100 


Zimmer (previously 


2009, 




1 998-Jan 


surgeon. 


40-75) 






(22-27) 




Sulzer) 


Sweden'^' 




2001 


single centre 














Engh, 2009, 


Randomised Moderate 91 (91) 


Oct 


NA 


NA (40-80) 


NA 


NA 


NA (6 


86 


DePuy 


US'" 




2003-Oct 










months-2 




Orthopaedics 






2005 










years) 






Jacobs, 2004, 


Randomised Moderate 171 (171) 


March 


Multi-surgeon, 


54.4 (NA) 


43 


66 (20) 


43 months 


72 


Johnson & 


US'" 




1997-July 


multicentre 












Johnson 






2000 
















MacDonald, 


Randomised Moderate 41 (41) 


March 


Multi-surgeon, 


NA 


NA 


NA 


3.2 


98 


BioMet 


2003, US'° 




1998- Nov 


multicentre 








(2.2-3.9) 










1999 
















Zijistra, 2009, 


Randomised Moderate 200(195) 


NA 


Multi-surgeon, 


71 (8, NA) 


79 


NA 


5.6 


97 


Biomet 


Netherlands''' 






multicentre 








(4.2-7.1) 






Ceramic on polyethylene v metal on metal 


Clarke, 2003, 


Comparative Moderate 249 (229) 


C-P 1995-8; 


Single 


56.1 (NA, 


51 


100 (0) 


3 months 


100 


Johnson & 






M-M 


surgeon. 


24-69) 






(NA) 




Johnson 






1998-2002 


single centre 














Dorr, 2004, 


Randomised Low 615(561) 


Aug 1999 


Multi-surgeon, 


60 (NA, 


NA 


NA 


5 (NA) 


NA 


Zimmer (previously 


US" 




start 


multicentre 


27-85) 










Sulzer) 


Ceramic on ceramic v metal on polyethylene 


Bascarevic, 


Randomised Moderate 157(150) 


Jan 


Multi-surgeon, 


54.8 (6.8) 


26 


84(5) 


51 months 




Zimmer 


2010, Serbia"' 


to high 


2003-April 


single centre 


















2008 
















Capello, 


Randomised Moderate 289 (278) 


Oct 


Multi-surgeon, 


53 (NA) 


36 


82 (15) 


8 (5-NA) 


93 


Stryker 


2008,'° 


to high 


1996-Oct 


multicentre 












Orthopaedics 


D'Antonio, 




1998 
















2005, US" 




















Capello 


Comparative Moderate 186(174) 


Sep 


Multi-surgeon, 


51 .8 (NA) 


31 


NA 


6.5 (5-NA) 


NA 


Stryker 


"partial," 


to high 


1999-Sep 


multicentre 












Orthopaedics 


2008, addition 




2000 
















of 4th system. 




















Trident, US=° 




















Ceramic on ceramic v ceramic on polyethylene 


Hamilton, 


Randomised Moderate 264 (263) 


2003 


Multi-surgeon, 


56.7 (NA, 


48 


88 (6) 


31 months 


88 


CeramTec, DePuy, 


2010, US'" 


to high 




multicentre 


20-75) 






(21-49 




DePuy-Johnson & 
















months) 




Johnson 


Kim, 2009, 


Randomised Moderate 200(100) 


Jan 2000- 


Single 


45.3 (NA, 


34 


10 (63) 


5.6 (5-7) 


100 


DePuy 


South Korea'' 


to high* 


Jan 2002 


surgeon. 


21-49) 


















single centre 
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Table 1 (continued) 



Study 


Design 


Study 
quality 


No of 
hips 
(patients) 


Date of 
procedure 


Site, surgeon, 
centre 


Mean (SD, 
range) 

age 
fvears) 


% 
womsn 


% 

osteo-arthritis 
(avascular 
necrosis) 


Mean 
follow-up 


(%) 
follow-up 


Manufacturer 


Lewis, 2010, 
Canada'' 


Randomised 


Moderate 


56 (55) 


Oct 

1997-Oct 
1999 


1-2 surgeons, 
single centre 


42.1 (8.0) 


NA 


41 (11) 


96 months 
(1-10) 


93 


Wright Medical 


Lombardi, 
2010, US=^' 


Randomised 


Low 


89 (80) 


June 2000 


Single 
surgeon 


58 (NA) 


46 


84 (9) 


6.1 

(2.1-9.0) 


NA 


BioMet 


Ochs, 2007 
Germany'''^ 


Randomised 


Moderate 
to high 


66 (66) 


1997-9 


Multi-surgeon, 
single centre 


58.7 (7.8, 
NA) 


42 


53 (NA) 


8.1 

(7.1-9.2) 


65 


Braun-Aesculap, 
CeramTec 


Poggie,2007, 

US"' 


Randomised 


Low 


472 (429) 


Oct 1999- 
Jan 2003 


Muiti-surgeon 


54 (12, 
23-76) 


45 


77(17) 


2 


99 


Zimmer/CeramTec 


Ceramic on polyethylene v metal on polyethylene 


Kawate, 
2009, Japan" 


Randomised 


Low 


62 (60) 


July 

2001 -July 
2003 


Multi-surgeon 


61.1 (NA, 
30-78) 


88 


69 (18) 


5 (4.8-5.4) 


100 


Japan Medical 
Material 


von 

Schewelov, 

2005, 

Sweden" 


Randomised 


Low 


114 (114) 


1996-8 


Muiti-surgeon, 
multicentre 


64 (NA, 
50-76) 


54 


NA 


5(NA) 


92 


DePuy, 

DePuy-Johnson & 
Johnson 


Ceramic on ceramic v metal on metal 


Grubl, 2006, 
Austria" 


Randomised 


Low 


28 (28) 


Nov 

2000-Feb 
2002 


NA 


62.2 (NA, 
32-81) 


64 


93 (7) 


1 (NA) 


NA 


Zimmer, CeramTec 



NA=not available (not applicable or not provided). 

*ln this study each person served as his/her own control (bilateral procedures). Functional outcomes are challenging to determine but potentially possible. Uncertainty 
about surgeon's involvement in functional outcome measurement leads to less than perfect quality classification. 
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Table ''I Harris hip score in comparative studies of bearing surfaces. Figures are mean scores (higher scores indicating better outcomes) 
(SD and range when available) for longest period of follow-up reported 





Metal on metal 


Ceramic on ceramic 


Metal on polyethylene 


Ceramic on polyethylene 


Study 


Preoperative Postoperative 


Preoperative Postoperative 


Preoperative Postoperative 


Preoperative Postoperative 


Dahlstrand, 2009*, 
Sweden"' 


38.1 (10.9) 


92.8 (12.9) 


NA 


NA 


38.9 (12.7) 


95.9 (4.8) 


NA 


NA 


Engh, 2009, US"= 


Done but NA 


93.5 (9) 


NA 


NA 


Done but NA 


96 (6) 


NA 


NA 


Jacobs, 2004, US" 


42 


95.4 (65-100) 


NA 


NA 


43 


96.1 (65-100) 


NA 


NA 


MacDonald, 2003, 


46.5 (13.4) 


91.6 (11.5) 


NA 


NA 


46.6 (12.4) 


89.2 (12.5) 


NA 


NA 


Zijistra, 2009t, 
Netherlands'^' 


48 (15) 


90 (7) 


NA 


NA 


46 (13) 


87 (13) 


NA 


NA 


Dorr, 2004, US'" 


NA 


93.5 (12, 
14-100) 


NA 


NA 


NA 


NA 


NA 


93.5 (11, 
41-100) 


Bascarevic, 2010, 
Serbia== 


NA 


NA 


NA 


95.1 


NA 


93.8 


NA 


NA 


Capello, 2008,=° 
D'Antonio, 2005,°' 
US 


NA 


NA 


NA 


96.35 


NA 


96.4 


NA 


NA 


Capello, 2008, 
addition of 4th 
system. Trident, 
US°° 


NA 


NA 


NA 


95.7 


NA 


96.4 


NA 


NA 


Hamiiton, 2010, 
US=^° 


NA 


NA 


50.6 


94.4 


NA 


NA 


50.7 


93.8 


Kim, 2009, South 
Korea°° 


NA 


NA 


39 (6-47) 


93 (85-100) 


NA 


NA 


41 (9-51) 


94 (86-100) 


Lombardi, 2010, 

US=^' 


NA 


NA 


51 (6-68) 


90 (50-100) 


NA 


NA 


48 (6-69) 


92 (49-100) 


Ochs, 2007, 
Germany"'^ 


NA 


NA 


NA 


91 (21.6) 


NA 


NA 


NA 


89 (12.1) 


Poggie, 2007, US'° 


NA 


NA 


45 (14-91) 


92 (36-100) 


NA 


NA 


43 (10-78) 


93 (51-100) 


Kawate, 2009, 
Japan'^^ 


NA 


NA 


NA 


NA 


47 (28-77) 


91(60-100) 


43 (15-75) 


89 (58-100) 


von Schewelov, 
2005, Sweden'' 


NA 


NA 


NA 


NA 


NA 


91.7 (11.2) 


NA 


91.3 (10.5) 


Grubl, 2006, 
Austria'" 


50.4 
(15.1-71.4) 


85.1 (61.1-100) 


58.4 
(24.4-82.3) 


86.6(41.5-100) 


NA 


NA 


NA 


NA 



NA=not available (not applicable or not provided). 

•Significant difference favouring metai on polyethylene compared with metai on metai at 12 months but not at 24 months after surgery. 
fMeasurements at two and five years. Only score at five years reported here (fig 2 shows both two and five year scores). 
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Table ' | Other functional outcomes In comparative studies of various bearing surfaces in hip replacements. Figures are mean (SD) scores 
on each scale 


Metal on metal 


Ceramic on ceramic 


Metal on polyethylene 


Ceramic on polyethylene 


Study Preoperative Postoperative 


Preoperative Postoperative 


Preoperative Postoperative 


Preoperative Postoperative 


Dahlstrand, 2009, Sweden" 


SF-36: PF 32.6 (22.6) 


67.2 (26.9)* 


NA 


NA 


40 (24.5) 


77 (19.3)* 


NA 


NA 


SF-36:BP 33.3(19.4) 


67.9 (31.7)* 


NA 


NA 


39.3 (19.2) 


76.7 (27.9)* 


NA 


NA 


Engh, 2009, US"= 


WOMAC NA 


88 (15) 


NA 


NA 


NA 


89 (19) 


NA 


NA 


PCS NA 
(SF-12) 


46.2 (11.6)t 


NA 


NA 


NA 


51 (9)t 


NA 


NA 


MCS NA 
(SF-12) 


56.8 (7)t 


NA 


NA 


NA 


55 m 


NA 


NA 


MacDonald, 2003, US'" 


WOMAC 58.7(15.3) 


17.3 (15.1 ) 


NA 


NA 


59.1 (14.5) 


19.9 (18.9) 


NA 


NA 


PCS SF-12 32.8(10.2) 


47.2 (9.6) 


NA 


NA 


31.1 (4.7) 


45.5 (11.7) 


NA 


NA 


MCS SF-12 53.5(9.6) 


55.9 (6.2) 


NA 


NA 


53.6 (11.6) 


53.3 (11.7) 


NA 


NA 


Zijistra, 2009, Netherlands'' 


Oxford hip 40 (8) 
score 


19(8) 


NA 


NA 


40 (8) 


18(8) 


NA 


NA 


Lewis, 2010, Canada'' 


St Michael's NA 
score 


NA 


15.8 


22.9 


NA 


NA 


16.2 


22.8 


Grubl, 2006, Austria'* 


UCLA NA 
activity score 


7/1 0 points§ 


NA 


7/10 points§ 


NA 


NA 


NA 


NA 


UCLA=University of California, Los Angeles; PF=physical function; BP=bodily pain; PCS=physical component score; MCS=mental component score; WOMAC=Western 
Ontario and McMaster Universities activity score; NA=not available (not applicable or not provided). 
*At 24 months. 

tDifference significant (P=0.03). Mean score for metal on metal calculated from two groups (28 mm and 36 mm head sizes). 
itiDifference not significant. Mean score for metal on metal calculated from two groups (28 mm and 36 mm head sizes). 
§At 12 months. 
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Table "] Revision surgery and reported disiocation in comparative studies of various bearing surfaces in hip replacements. Figures are 
numbers (percentages) 




Metal on metal 


Ceramic on ceramic 


Metal on polyethylene 


Ceramic on polyethylene 


Study 


Revision 
surgery 


Reported 
dislocation 


Revision 
surgery 


Reported 
dislocation 


Revision 
surgery 


Reported 
dislocation 


Revision 
surgery 


Reported 
dislocation 


Jacobs, 2004, US*° 


1/95 (1.1) 


1/95 (1.1) 


NA 


NA 


1/76 (1.3) 


1/76 (1.3) 


NA 


NA 


Zijistra, 2009, 
Netherlands'^' 


3/98 (2.9) 


NA 


NA 


NA 


1/92 (1.1) 


NA 


NA 


NA 


Clarke, 2003, UK=' 


NA 


1/109 (0.9)* 


NA 


NA 


NA 


NA 


NA 


9/140 (6.4)* 


Dorr, 2004, US'* 


10/311 (3.2) 


12/304 (3.9) 


NA 


NA 


NA 


NA 


8/304 (2.6) 


12/304 = 3.9) 


Bascarevic, 2010, 
Serbia'" 


NA 


NA 


2/82 (2.4) 


1/82 (1.2) 


2/75 (2.7) 


2/75 (2.7) 


NA 


NA 


Capello, 2008,'° 
D'Antonio, 2005, US" 


NA 


NA 


6/349 (1.7)* 


10/349 (2.9) 


10/165 (6.1)* 


7/165 (4.2) 


NA 


NA 


Capello, 2008, addition 
of 4tfi system. Trident, 
US'" 


NA 


NA 


1/186 (0.5)* 


NA 


10/165 (6.1)* 


7/165 (4.2) 


NA 


NA 


Hamilton, 2010, US'" 


NA 


NA 


4/177 (2.3) 


5/177 (2.8) 


NA 


NA 


2/87 (2.3) 


4/87 (4.6) 


Kim, 2009, Soutti 
Korea'= 


NA 


NA 


NA 


1/100 (1) 


NA 


NA 


NA 


2/100 ( 2) 


Lewis, 2010, Canada'" 


NA 


NA 


1/30 (3.3) 


NA 


NA 


NA 


1/26 (3.8) 


NA 


Lombardi, 2010, US" 


NA 


NA 


3/44 (6.8) 


NA 


NA 


NA 


3/45 (6.7) 


NA 


Poggie, 2007, US" 


NA 


NA 


4/315 (1.3) 


NA 


NA 


NA 


2/157 (1.3) 


NA 


von Schewelov, 2005, 
Sweden" 


NA 


NA 


NA 


NA 


1/58 (1.7) 


NA 


3/56 (5.3) 


NA 


NA= not available (not applicable or not provided). 
•Significant group difference. 



No commercial reuse: See rights and reprints http://www.bmj.com/permissions Subscribe: http://www.bmi.com/subscribe 



e/WJ2011;343:d7434 doi: 10.1136/bmj.d7434 (Published 29 November 2011) 



Page 11 of 12 



RESEARCH 



Figures 



study 



Metal on metal Metal on polyethylene 

Mean(SD) No In Mean(SD) No In 
score sample score sample 



Dahlstrand 2009" 92.8(12.9) 
Engh 2009^" 93.5 (10.0) 

MacDonald 20033"^ 91.6(11.5) 
Zijlstra 2009» 85.0 (9.0) 
Total (95% CI) 

Test for heterogeneity: 1^=0.00, % 



95.9 (4.8) 
96.0 (6.0) 
89.2 (12.5) 
88.0 (12.0) 



28 
57 

23 
75 
183 

=1.79, df=3, P=0.62, 1 



Mean difference 
Onverse variance) 
(random 95% CO 




Weight Mean difference 
(%) (inverse variance) 
(random 95% CQ 



-3.10 (-8.22 to 2.02) 
-2.50 (-5.79 to 0.79) 
2.40 (-5.05 to 9.85) 
-3.00 (-6.42 to 0.42) 
-2.40 (-4.47 to -0.33) 



Test for overall effect: z=2.28, P=0.02 



-20 -10 0 

Favours metal 
on polyethylene 

Fig 1 Harris hip scores at two years (top) after hip implants with metal on metal versus metal on polyethylene bearings. 
Zijlstra et aP^ measured score at both 24 months and 60 months 



Favours metal 
on metal 



Study 



Metal on metal 

Mean(SD) No in 
score sample 



Metal on polyethylene or 
ceramic on polyethylene 

Mean(SD) No in 
score sample 



Metal on metal v metal of polyethylene 

130065 2004™ 95.4 (12.0) 95 96.1 (12.0) 76 

Zijlstra 2009^' 90.0 (7.0) 75 87.0 (13.0) 73 

Total (95% CI) 170 149 

Test for heterogeneity: 12=3.66,-/^=2.1 5, df=l, P=0.14, P=53% 
Test for overall effect: z-0.65, P-0.51 

Metal on metal v ceramic on polyethylene 

Dorr 2004" 93.5 (12) 311 93.5 (11) 304 

Subtotal (95% CI) 311 304 

Test for overall effect: z=0.00, P=1.00 



Mean difference 
Qnverse variance) 
(random 95% CO 




Weight Mean difference 
(%) Onverse variance) 
(random 95% CO 



-0.70 (-4.32 to 2.92) 
3.00 (-0.38 to 6.38) 
1.21 (-2.41 to 4.83) 



54 0.00 (-1.82 to 1.82) 
54 0.00 (-1.82 to 1.82) 



100 0.58 (-1.32 to 2.47) 



Favours metal 
on metal 



Total (95% CI) 481 453 

Test for heterogeneity: 1-^=0.88, X^=2.81, df=2, P=0.24, 1^=29% 
Test for overall effect: 2=0.60, P=0.55 -20 -10 

Favours metal on 

polyethylene or 

ceramic on 

polyethylene 

Fig 2 Harris hip score beyond two years after hip implants with metal on metal versus metal on polyethylene or ceramic on 
polyethylene bearings. Jacobs et al'"' assessed score at mean 43 months while Dorr et aP" assessed score at 60 months. 
Zijlstra et aP^ measured score at both 24 months and 60 months. SD of mean imputed for Jacobs et al and sensitivity 
analyses performed; varying estimates of SD did not change overall estimate of effect 
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Metal on polyethylene 



Australian National Registry: More revisions with metal on metal 
New Zealand National Registry: More revisions with metal on metal 
England and Wales National Registry: More revisions with metal on metal 
No difference in three others 



Australian National Registry: 
More revisions with ceramic on ceramic 
No difference in five others 



Metal on metal 



Australian National Registry: 
More revisions with metal on metal 
No difference in five registries 



New Zealand National Registry: 

More revisions with ceramic on polyethylene 
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Fig 3 Qualitative evidence repoiled in six national data sources/registries (Australia, New Zealand, England and Wales, 
Italy (Emilia-Romagna), Kaiser and CMS databases) 
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